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asymptotics under the null-hypothesis is described. Several examples of such statistics used for 
pH I testing goodness-of-fit and symmetry are considered. It is shown how to calculate their local 

Bahadur efficiency. 
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PLh 1 Introduction. 

! Let Xi,X2,... be i.i.d. observations with continuous distribution function (df) F. 

Denote by -F„ the usual empirical df (edf) based on the first n observations. The 
Kolmogorov statistic is the distribution-free statistic defined by 

> 

OO: Dn = snp \Fn{t)-Fit)\. (1) 

O 



O 

o 



In order to describe the large deviation asymptotics of statistic dH) consider for 
< a < 1 the function 



^ , CL + 1 ,^ . , 1 — a — t 
(a + t) In -f (1 - a - t) In , < t < 1 - a . 



OO, 1 — a<t<l 



and put 

/o(a) = ^inf^ /(a,t) . 

The following theorem was proved in [T] , see also and [5] . 
Theorem 1.1. For any a G (0, 1) we have 

lim n"^ \n¥{Dn > a) = -fo{a), 
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where the function fo{a) is continuous on (0, 1), and as a ^ 

/o(a) = 2a2(l + o(l)). 

In particular, this result can be used to calculate Bahadur efficiency of various 
modifications of Kolmogorov-Smirnov tests, see [2] and 

We are interested in [/-empirical generalizations of Theorem 1.1 and their appli- 
cations to the problem of testing goodness-of-fit and symmetry. Let h{xi, x^) be 
a real- valued symmetric kernel of degree m > 1. Consider the [/-empirical df (udf) 

Gn{t)=('') V i{h{x,^,...,x,J<t}, teR\ 

^ ^ l<ii<...<i„i<n 

The properties of such udf's and their use in Statistics were studied in [H O El [7] . 
Also denote 

G(t)=P(/i(Xi,...,X„)<t) (2) 

and assume that this df is continuous and (for simplicity) strictly monotonic. Then 
the [/-empirical analog of the Kolmogorov statistic ([T]) has the form 

D[/„ = sup|G„(t)-G'(t)| 
t 

and coincides with the Kolmogorov statistic when m = 1 and h{t) = t. 

Various tests of goodness-of-fit and symmetry can be constructed utilizing the 
Kolmogorov- S mir nov-typ e st at ist ic 

SUn = snp\Gn{t)-Fn{t)\, (3) 
t 

with various choices of the kernel h. Statistics of this type frequently appear in 
constructing statistical tests based on characterizations of distributions; then it is 
assumed that G{t) = F{t), see El [lOl E] . 

The next example illustrates the idea of building such tests. Consider scale- 
free testing of exponentiality based on Desu's characterization [12]: Let Xi and 
X2 be independent non-degenerate and non-negative rv's with common df F. Then 
the rv's 2min(Xi,X2) and Xi have the same distribution if and only if F{x) = 
1 — exp(— Ax), X > for some A > 0. 

Let us compare the [/-empirical df 

Gn{t) = ^{2min(X„X,) < t} 

^<j<k<n 
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with the usual edf by considering the statistic 



DE, = sup|G„(t)-F,(t)| 



and assuming its large values to be critical. The limiting distribution of this statis- 
tic can be found using the results of [13] where weak convergence of U- empirical 
processes is studied, see also [H] and [1]. Critical values of DEn can be calculated 
via simulation. 

The logarithmic large deviation asymptotics of DEn enables to calculate its Ba- 
hadur efficiency and compare it with other statistics used for testing exponentiality, 
see [21 IHl [15] . Various examples of similar statistics will be given at the end of this 
paper. 

The problem of large deviation asymptotics for [/-statistics is studied insuffi- 
ciently. The abstract large deviation principle stated in [I6l and [7| is non-effective 
for statistical applications. The point is that the rate of decrease for large deviation 
probabilities is given implicitly, as a value of complicated extremal problem in which 
the KuUback-Leibler information is minimized on an involved set of probability mea- 
sures. The result that could actually be apphed to the efficiency calculations was 
first obtained for bounded kernels in [18]. This result is stated as follows. 

Theorem 1.2. Consider the U -statistic of degree m>l 



with centred, hounded, and non- degenerate real-valued kernel $, so that 

E$(a;i,...,X„) =0, < M, 

and (7^ = Eip'^{Xi) > with ip{y) = E{^{Xi, ...,Xm)\Xi = y). Then we have 



where the series converges for sufficiently small a > and 62 = {2m?a'^)~^. 

The aim of the present paper is to extend Theorem 1.1 (using Theorem 1.2) to 
the case of statistics DUn, SUn, their one-sided variants, and some other similar 
statistics. In the last section we show how to use the large deviation results for 
calculating local Bahadur efficiencies. 




00 



lim n"^ lnP(f/„ > a) := -^(a|$) = - V bja^ , 

i—>oo ' ^ 

i=2 



(4) 
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2 Auxiliary arguments and formulation of results 

To study large deviations of Kolmogorov-type statistics, it is natural to begin with 
a simple one-sided statistics, see [1], [2], 

DU+ = snp[Gnit) - Git)], DU- = sup[G(t) - Gnit)]. 

t t 

For any e > denote P„(£) = ¥{DUn > e), and let P+(£) and P-{£) be the 
corresponding probabilities for DU^ and DU^ . It is evident that 

max(P+(£), p-{e)) < Pr^e) < 2 max(P+(£), p-(£)). (5) 

Therefore, if we prove that for some functions and g~ 

lim n~^\\iP^[e) = ~g^{e), lim n~MnP~(e) = —g~{e), 

n—*oo n^oo 

then, by ([5]), we obtain 

lim n"MnP„(e) = — m.m{g^ (e) , g^ (e)) . (6) 

The above argument is also valid for the statistic SUn when we use one-sided 
statistics SU^ and SU~ . For this reason, below we consider large deviations of 
one-sided test statistics only. 

Let us impose some restrictions on the kernel. We see that the statistic DU^ is, 
in fact, the supremum of the family of [/-statistics with the kernels 

Q{xi,...,xm;t) = I{h{xi,...,xm) <t} -G{t), teR\ (7) 

depending on t. The same is true for the statistic SU^, where the kernels also depend 
on t and have the form 

m 

'^{xu...,Xm;t) = I{h{xi,...,Xm) <t} -m~^^I{xi <t}, teR\ (8) 

i=l 

For any t the kernels 0(- ; t) and ]t) are centred and bounded. Consider their 
projections 

e{s;t) = E(e(Xi,...,X^;t)|Xi = s), tlj{s;t) = X^; t)|Xi = s), 

and the variance functions 

alit) = Ee\Xv, t), = Eij\X,- 1). 

4 



In the sequel, we will work with general families of kernels $(■ ;t G T), that 
comprise, as particular cases, the families 9(- ; t) and \E'(- ; t) defined by ([7]) and 
(IE]). For definiteness, we assume that the parameter set T is some finite or infinite 
interval [a, b] of the real line. 

Most results in the literature related to large deviations of U-statistics deal with 
U-statistics with specific kernels, see, for example, [TTl [16], [HI [19]. Unlike these 
results, we study whole families of kernels and the corresponding families of U- 
statistics indexed by real parameter t. This is a distinctive feature of the paper. 

We say that the family of [/-statistics {f/„(t),t G [a,b]} with the kernels $(■ ; t) 
and projections ip{- ; t) is non- degenerate, if its variance function cr'^it) = Eip'^{Xi; t) 
can vanish only at the ends of the interval [a, b] and at a finite number of points in 
the interior of this interval. 

For example, the one-sided variant DE^ of Desu's statistic ([T]) has the variance 
function equal to |exp(— — exp(— > 0. Another typical variance function 
that appears below in Example 5 is equal to — |t|),— 1 < t < 1. Both families 
of [/-statistics are non-degenerate according to our definition. 

Denote by v?o the maximum of the variance function, i.e., ipl = supiCr^(t) > 
and introduce the following subset of the parameter set : 

T = {t\te[a,b], aj(t)>^^2|_ 

On this set the variance function is separated from zero. Denote by the comple- 
ment of T in [a,b]. 

Consider the family of [/-statistics {Un(t),t G [a, b] } corresponding to the family 
of kernels {$(• ; t)}. For t (zT the kernels are non-degenerate, centred, and bounded. 
Then the application of Theorem 1.2 yields relation (j4]) with the function of two 
arguments 

oo 

g{a,t\^) = Y,hity (9) 
i=2 

where the coefficients bj{t) depend on t unlike the function g{a\^). Put 

gT{a\^) :=inf^7(a,t|$). (10) 

It turns out that this function determines the large deviation asymptotics of the 
statistics under consideration, and that the behavior of the function t g{a,t\<^) 
on the set is of no importance. The set T can be somewhat enlarged or narrowed 
without essential changes in the results. 
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As usually in large deviation theory, deriving the large deviation asymptotics 
will consist in obtaining upper bound and lower bound. Asymptotic coincidence of 
the bounds would give the desired result. We begin with the lower bound. 

Theorem 2.1. Suppose that the non-degenerate family of kernels {$(xi, Xm', t), 
t G [a, b]} is bounded and centred for all t. Then for a > we have 

liminf n-MnP(supf/„(t) > a) > -gT{a\^). (11) 

n— >oo ^ 

The function a — > gT{a\^) is continuous for sufficiently small a and admits the 
representation 

^'^^'"'^^^ 2^;^ + ^^'''^' ^^^^ 

From this theorem the lower bound on the large deviation probabilities for U- 
empirical Kolmogorov-Smirnov tests follows. Both one-sided statistics DU^ and 
DU~ satisfy the conditions of Theorem 2.1. They correspond to different families of 
kernels 0(-; ^) and — ©(■; t) with common variance function crg(t). The corresponding 
functions gT{a\Q) and gT{0'\ — ©) are also different, but the first terms of their 
asymptotic expansions as a — > are the same. Now we obtain from ( fTTl) 



liminf n"^ lnF{DU^ > a) > -gM ± 6), (13) 

n— >oo 

and similarly 

liminf n"^ \n¥{SU^ > a) > -gria] ± ^). (14) 

For the purpose of obtaining the upper bound, we assume that the following 
condition of monotonicity in parameter on the family {Un{t),t G is satisfied. 

Suppose there exists a sequence of partitions of the interval [a, b] into parts: 
a = to < ti < ■ ■ ■ < t^ = b, such that the nodes of the partition do not coincide 
with the zeros of the variance function and that for any = 0, . . . , A^ — 1 

sup Unit) < Un{tk+i) + A„(Ar), (15) 

where the sequence of rv's A„(A^) decreases fast, when n and A^ grow. More precisely, 
it means that there exists a sequence {tn}, Tjy ^ as N ^ oo, such that 

lim lim n~MnP(A„(A^) > tn) = -oo. (16) 

N—>oo n— »cxD 

Under this condition the upper bound result takes the following form. 



6 



Theorem 2.2. Suppose that the conditions of Theorem 2.1 are valid and that 
the condition of monotonicity in parameter of the family {Un{t)} holds. Then we 
have 

limsup n"MnP(supf/„(t) > a) < -gT{a\^). (17) 

n— >oo t 

Combining Theorems 2.1 and 2.2, we arrive at the foUowing resuh. 
Theorem 2.3. Under the conditions of Theorem 2.2 we have 

hm n"MnP(sup [/„(t) > a) = -grial^). 

n~*oo ^ 

It is shown below that both statistics DU^ and SU^ satisfy the monotonicity 
condition. If the corresponding families of kernels ([7]) and ([8]) are non-degenerate 
and centred, then Theorem 2.1 is applicable. Hence ([5]) and ([6]) imply the following 
result for the two-sided statistics. 

Theorem 2.4. // the family of kernels (JTj) is non- degenerate, then 

lim n"^ InW^DUn > a) = f_D(a) = — min(gT{a\Q) , gT{a\ — 0)). 
// the family of kernels is non-degenerate and centred, then 

lim n~^\nF{SUn > a) = Vsia) = - mm{gTia\'^), gria] - ^')), 
The functions vu and vs are continuous and satisfy the asymptotic relations 



3 Lower bound. 

In order to prove the lower bound we need the theorem on implicit analytic operators. 
Consider three Banach spaces Ei, E2, and E3. Denote by Vr{xo,E) the ball in the 
space E of radius r with center at xq. We are interested in finding solutions x = x{y) 
of the operator equation 

F{x,y) = (18) 
(here y plays a part of a parameter) under the condition 

xiyo) = xq. (19) 
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We assume that the operator F{x,y) is analytic and the condition F{xo,yo) = 
holds. For main definitions and facts of the theory of analytic operators in Banach 
spaces, we refer to [20l §22] and [211 §32]. In particular, the operator F{x,y) 
is called analytic in some domain if in the neighbourhood of any point it can be 
represented as an uniformly convergent Taylor operator series |20] . 

Theorem 3.1. see f2^ Theorem 22.2 ]. Suppose that the operator F{x,y) is 
analytic in Vr{xo, Ei) xV^{yo, E2) with values in E^. Let the operator B = F^{xo, yo) 
(the derivative is understood in the Frechet sense) has a bounded inverse operator. 
Then there exist positive numbers ri and pi such that the equation ( |T8|) has an unique 
solution X = f{y) in the ball V^^^Xq, Ei) . This solution is defined and analytic in 
the ball Vp-^{yQ, E2), and satisfies the condition (fT9|) . 

We will prove Theorem 3.1 using the arguments of [H] in conjunction with some 
auxiliary results. The main idea is to construct a majorant series for solutions of 
nonlinear equations that would guarantee the uniform convergence with respect to 
the parameter t. 

Proof of Theorem 3.1. It is clear that for any statistic of the form sup(f/n(t), 
with kernels $(■ ; t) and projections ip{- ;t), the following holds true: 

P( supf/„(t) > a) > supP(f/„(t) > a). 

t teT 

Consequently, 

liminf n~MnP(sup f/„(t) > a) > - inf g{a,t\(^) = -gT{a\(^). 

Hence in order to obtain the lower bound we must analyze the function g{a,t\^) in 
(E]) over the set T. Recall that for t G T we have a^it) > ^(fl > 0. 

For any Borel set S G consider the Banach space of measurable bounded 
functions B{S) with the norm 

II X 11= SUp,g5|x(s)| . 

We may assume that the initial observations Xi,i = l,...,n, are uniformly 
distributed on J = [0,1]. Otherwise we can consider the sample Ui = F{Xi),i = 
1, . . . , n, and replace the initial kernel h{Xi, Xm) by h{F~^{Ui), . . . , F^^iUm))- 
The families of kernels G(- ; t) and the corresponding families of 

[/-statistics depending on Ui remain centred, bounded, and non-degenerate. 

For simplicity we consider only the kernels of degree 2. With slight changes in 
the proof, the results remain valid for kernels of any finite degree. 
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For any t G T we use the variant of Sanov's theorem for large deviations of 
[/-empirical measures from [l6j, see also [7]. So, we reduce the problem of large de- 
viations to the problem of the minimization of entropy under suitable normalization 
conditions. It follows from [18] that the function of interest g{a,t\^) is the solution 
of the extremal problem 

g{a, = inf { / (1 + ax(s; t)) ln(l + ax{s; t))ds : x e B{I xT), f x{s] t)ds = 0} 
Jo Jo 

(20) 

under the normalization condition 

{p{si]t)x{si]t)dsi + / / ^{si,S2;t)x{si;t)x{s2;t)dsids2 = I. (21) 
Jo Jo 

The Euler-Lagrange equation for the extremal x{s;t) from B{I x T) takes the 
form, see 281 : 



(1 + ax{si; t)) /g exp |A(t)a $(si, S2] t){l + ax{s2; t))ds2j dsi- 
-exp <^ \{t)a /g $(si, S2] t)(l + ax{s2] t))ds2 r = 0, 

under the same normalization condition (12T]) . To simplify the notations denote 

p{x\si]t) = I ^{si,S2]t)x{s2]t)dS2. 



Then (]22l) can be written as 

(1 + ax{si] t)) exp {\{t)aip{si] t) + \{t)a? p{x\si] t)) dsi 



(23) 

exp {\{t)aLp{si] t) + \{t)o?' p{x\si] t)) = 0, 



and the condition fl2Tl) becomes 



/ {2ip{si]t) + ap(x\si;t))x{si;t)dsi = 1. (24) 
Jo 

Expanding the exponents on the left-hand side of fl23l) into a series and integrat- 
ing, we get 

E.+,>o A^+^(t)a^+2^(A;!j!)-i v^{s,; t)p{x\s,; t)ds,- 

E.+,>o \'^KtV^'Km~'v\s,- t)p{x\s,- 1) + 

Ek+j>o X'^'{t)a'-'''-'\m-'x{s,; t) J,' ^'^{s,; t)p{x\s,; t)ds, = 0. 
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To simplify this equation we extract two first terms corresponding to the indices 
k = 0, j = and k = 1, j = 0, so that the remainder of the sum is taken over the 
set of indices N{k,j) = {{k,j) ^ (0, 0), (1, 0)}. After some algebra we obtain the 
equation 



X 



^^{sx\ t)p{x\sx\ t) - (1 + ax{sx\ t)) (p'^isi, t)p{x\si; t)dsi 



(25) 

0. ^ ^ 



Note that for a = the "principal" solution of this equation satisfying the 
normalization condition is Xo{t,s) = Ao(t)¥'(s; t), where Ao(t) = {2a'^(t))~^. Our aim 
is to build the "perturbation" of this solution for a > 0. 

We introduce a new small functional parameter z/(t) = A(t) — Xo{t) and a new 
unknown function y{t, s) = x(t, s) — Xo(t){p{s] t), and substitute them into equation 
([25D. We have 

y{si;t) - iy{t)(p{si;t) + 

ENik.M) + Ut))'^'ci''+''^\k\j\)-'[^'^{^^^^ 

if'^is^; t)pi{y + Ao(t)v.(si; t)\s,- t)ds^] - Y^Nik^M^) + X,{t)f+^a^+^^m)-^ x 
{y{s2]t) + Xo{t)^{s2]t)) jl^^{si]t)p>{y{si,t) + Xo{t)^{si]t)\si,t)dsi = 0. 

(26) 

Due to the inequality 

Ao(t) = (2a5(t))-i < t e T, 

and the boundedness of the kernel, the series on the left-hand side are convergent se- 
ries of fc, j-linear operators (see [20j ) in y and u with bounded coefficients. Therefore 
the left-hand side of equation (!26i) is the analytic operator 



A{y, u, a) : B{I x T) x B{T) x B{I x T). 

The Frechet derivative Ay{y, 0,0) at the point y = is the unit operator and 
hence is bounded. Then Theorem 3.1 guarantees the existence of a solution of the 
form 

yisut) = J2 Ckjisuty{t)a\ (27) 

k+j>l 

where the series is absolutely convergent in the space B{I xT) for sufficiently small 
I I and a. This means the convergence of the power series with numerical coefficients 
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Ylk+j>i W'^kjW ■ ll'^ll'^'^"' for sufficiently small ||z/|| and a. Note that in the proof of 
Theorem 3.1 in [20j the majorant series for the solution was built explicitly. 

Now we substitute solution (!27l) into equation ( !26l) . Equating the coefficients at 
the same powers of u and a, we obtain the expressions for Ckj- For example, 

Cio(s; t) = ip{s] t), coi(s; t) = ^Xl{t){ip{s; t) - 3aJ(t)) + Ao(t)p(s; t), 
and so on. Returning to the function x, we have 

x(s; t) = \oit)ipis; t) + t)u''{t)a^. (28) 

fc+i>i 

Substituting this solution into normalization condition (^M), we obtain 

+ Ek+j>i E^+l>l lo lo Hsi, S2; tMs,; t)cu{s2; t)ds,ds2 a^^^v^^\t) = 1. (29) 

As Cio(s;t) = ip{s;t), the coefficient at uit) is equal to 2cr^(t) and is positive on T. 
Dividing by 2(T^()f:), we have the equation 

E 7;^/Wa'^'W, (30) 

fc>l,/>2 

where, as seen from (!29|) . the series with coefficients 7fc/(t) converges absolutely in 
some ball of the space B(T). 

Applying again Theorem 3.1 to equation (!29l) . we obtain the representation 

iy{t) = J2^pit)<''^ (31) 

p>i 

where the series converges absolutely in B{T) for sufficiently small a > 0. Substi- 
tuting ( 13T|) into f l28l) . we get again the convergent series. Returning to the extremal 
x{s, t) = \o(t)xo{s; t) +y(t), we substitute the new series for x into (l20i) . Integrating 
term-wise and using the convergence of the series for the solution, we obtain the 
expression for g{a,t) of the form 

oo 

g{a,m = J2(^^ity^t^T, (32) 

i=2 

where C2(t) = {8a'^(t))^^, and the series is convergent for sufficiently small positive 
a, so that the majorant series Yli^i Ikilk* is also convergent. 
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Now we can prove the continuity in a and other properties of the function 5fr(a|$) 
hsted in Theorem 2.1. Indeed, for any ai > 02 from the interval of convergence of 
the majorant series we have 

|^r(ai|$) - gT{a2\^)\ < ^^Vter I Y.t=2 Cfc(t)K - ^2)! 

^ I I Til II k—l 

< \ai - 02! 2^k=2 ^ llcfclFi • 

The series ^1^=2^ Ikfcllfli^^ has the same radius of convergence as the majorant 
series Xli^i hence the sum Yl'^=2^ I|ca:||oi~^ is bounded, and the continuity 

of gT{ci\^) follows. 

Now let us estimate the difference \gT{a\^) — a^/{8Lpl)\ for small a. Note that 
= inf g{t,a\(^) < a^/8(pl + sup | y^Cfc(t)a^|. 

On the other hand. 



a'. 



^r(a|$) = inf ^(t, a|<l>) > a^S^l - sup V(-Cfe(t)) 
Hence we obtain 

\gT{a\<i>) - aV(8v9^)| < EZsW^kWa' = a' EZslMa'"' = 0{a'), a ^ 0. 
For kernels of degree m > 2 in f|T2|) . the term 2m'^ipQ appears instead of Sip^. □ 



4 Upper bound. 

In this section, we bound the large deviation probabilities from above. For this we 
apply the exponential inequality for non-degenerate [/-statistics from [221 Theorem 
2.] For simplicity, we give here a slightly weaker version of it. 

Lemma 4.1 Under the conditions of Theorem 1.2 with cr^ > and z > we 
have 

where L := + (2/3)m-i) M. 
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Proof of Theorem 2.2. Consider the partition of the parametric set a = to < ti < 
■ ■ ■ < tj\f = b from the monotonicity condition with the nodes different from the zeros 
of the variance function cr^(t). On any interval of the form [tk, tk+i), k = 0, . . . , N—1, 
we have 

sup Un{t)< Un {tk+1 )+An{N). (33) 

tk<'t<ik+i 

Next, using for r^v > from condition f[T^ . 

P(sup, Unit) >a) < Zk=o n^^Pt,<t<t,^, Unit) >a)< 

< EkJ nUnitk+i) > a - r^) + NFiAniN) > r^) = Ti,^ + T^^n- 

Let us divide the sum Fi^tv into two parts: the first sum includes the indices k 
for which tk+i G T, while the second sum includes all remaining indices. For k = 
0, . . . , N — 2 the rv Unitk+i) is a [/-statistic with centred and bounded kernel $. By 
Theorem 1.2 we have for the summands of the first sum 

P([/n(tfc+i) > a-TN) = expi~ngia - tn ,tk+i) + oinj) < exp(-n5(r(a - tat) + o(n)). 

For the summands of the second sum, by Lemma 4.1 for tkJ^-i ^ T 

nia — Tjy)'^ 



fiUnitk+i) > a-rjv) < 4exp 
Therefore 



m?(fl + L(a - Tat) 



Ti,N < Nexpi-ngria - rjv|$) + o(n)) + 4iVexp ( ,\ ^ , ] . (34) 



nia — ttv)^ 
m'^ipl + L(a - Tat) 

Thanks to (fT6|l the term F2,Ar decreases faster than Fi^tv, and can be neglected. 
Taking the logarithms in the inequality dividing by n and passing to the limit 
as n ^ oo, we obtain 

limsupn""*^ lnP(sup f/n(t) > a) < — min ( g^ia — r^l^) , — —4 :rr^ r I • 

n^oo t V m^(p^ + Lia - tn)J 

By continuity of the function g^i- |$) as N —>■ oo 

limsup ?7.~MnP(sup f/n(t) > a) <— min ((7^(0!$), a^(m^v9Q + La)~^) . 

n^oo t 

But for small a, by ( fT2l) . gxia]^) < a^im^ipl + La)~^. Hence we obtain the required 
inequality 

limsupn"MnP(sup f/n(t) > a) < -^'^(al^')- □ 

n^oo t 

Theorem 2.3 follows immediately from Theorems 2.1 and 2.2. 
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5 Kolmogorov-Smirnov-type statistics 



In this section we prove that the Kolmogorov-Smirnov-type statistics DU^ and SU^ 
satisfy the monotonicity condition, so that Theorem 2.3 is apphcable to them. 

First, consider the statistic DU:^ and assume that the family of kernels 9(- ; t) 
is non-degenerate and 9q > 0. Let be a large number such that a — N^^ > 0. 
We divide the parametric set into N parts with nodes = G~^{k/N), k = 0, N, 
where G is from ([2]). If some interior node coincides with the zero of the variance 
function, we make a shift of order 0(A^^^). On any interval [tk, tfc+i), k = 0, N — 1 
we have 

sup (G„(t) - G{t)) < - + O(iV-i). 

tfe<i<ifc+i 

Hence the monotonicity condition f|T5|) holds if we take A„(A^) = 0(A^"^), and 
to be any sequence tending to zero slower than N~^. The same procedure is applied 
to the statistic DU~ . 

The arguments for SU:^ are similar. Take the nodes t^^N = F~^{j^)i k = 0, N, 
and shift them, if necessary, as above. Hence for A; = 0, . . . , — 1 we have 

sup [Gn{t) - Fn{t)] < Gn{tk+l) - F^{h+{) + - F„(tfc). 

The part of the quantity An(A^) in the monotonicity condition plays the rv 

n 

Fn{tk+i) - Fnih) = n-^^I{tk < Xj < tk+l}. 

i=i 

Obviously, the sum on the right-hand side has the binomial distribution with pa- 
rameters n and p = 1/N. The next lemma is proved in [11] . 

Lemma 5.1 Let Bin{n, 1/N) be the rv having binomial distribution with param- 
eters n and 1/N. Then for any r G (0, 1) the following inequality holds 

F{Bin{n, 1/N) > nr) < 4" exp( -nr In A^). 

We apply this Lemma with r = = (lnA^)~^/^. For sufficiently large 
P(Sm(n, 1/A^) > n/VhiN) < exp{-^nVlnN), 

and hence 

P(i^n(4+i) - Fnitk) > Tjv) < F{Bm{n, 1/N) > nr^) < exp{-^nVhJ^). (35) 
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From this the monotonicity condition follows. The arguments for SU^ are similar. 

We see that Theorem 2.2 is apphcable to statistics DU^ and SU^ if correspond- 
ing families of kernels are non-degenerate and centred. In this case 

limsup„^oo "-""^ lnP(D[/^ > 0.) < -gT{,ci\ ±0), 

limsup„^oon"MnP(S'f/^ > a) < -gT{a\ ±^). 

Together with f|T3l) and f|T4|) this implies 

lim„^oon~MnP(L)f/^ > a) = -^T(a| ± 6), 

\iuin^oon-^\n¥{SU^ > a) = -gria] ± ^). 

From these relations, as explained at the end of section 2, Theorem 2.4 follows. 

In order to illustrate the result on large deviations of the statistic DUn, assume 
for simplicity that the initial continuous d.f. F is defined on some finite or infinite in- 
terval [a, b] and is strictly monotonic there. Consider the kernel h{x, y) = max(x, y). 
Then EI{h{X,Y) < t} = F'^{t). It follows that the kernels corresponding to one- 
sided statistics Du!^"'^^ have the form 9(x, y; t) = /{max(x, y) < t} — F'^{t),t G R^. 
The projections of these kernels are 

e{x; t) = P(max(a;, Y) < t) - F\t) = I{x < t} F{t) - F\t), 

and the common variance function is 

a',{t)=Ee'{X;t) = F'{t){l-F{t)). 

The maximum of this function is 27/256. It now follows that for some continuous 
function vq 

32 

lim n-MnP(Dt/^'^^") > a) = vo{a) = — -a^ + O(a^), a 0. 

n— >oo 27 

6 Statistical applications 

In this section we apply our general theorems proved above to particular [/-empirical 
tests of Kolmogorov-Smirnov type. At the same time, we fill small gaps in the proofs 
of [ini [H] and [25], where the incorrect paper [23] was used. 

1. Test of exponentiality based on Desu's characterization. Let us return to 
the statistic DEn given in the Introduction. This statistic is scale-free, so we can 
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assume that the observations have standard exponential distribution. The kernel of 
the family of ^/-statistics DE^ takes the equivalent form 

^(x, y; t) = >t} + I{y > t}) - I{mm{x, y) > t/2}, x,y,t> 0. 

The projection is given by 

^{y; t) = i(P{Xi >t} + I{y > t}] - E[/{min(Xi, y) > t/2}] 
= \ exp(-t) + \l{y >t}- exp(-t/2) I{y > t/2}, 

and hence the variance function cr^(t) = |exp(— — exp(— > 0. So, the 
family of kernels is non-degenerate, and we can apply Theorem 2.4. It is seen that 
ipQ = sup^^Q a'^it) = j^, and this determines the first term of the large deviation 
asymptotics. So, there exists a continuous function vi such that 

lim n-^ lnF{DEn > a) = vi{a) = -2c? + O(a^), a 0. (36) 

n^oo 

2. Another test of exponentiality is based on the simplified "lack of memory" 
property, see [8] and [11]. Consider the statistic 

AAr+ = sup[F„(2x)-F„'(x)]. 

x'>S) 

Statistics AN~ and AN^ are defined analogously. Large values of these statistics 
are statistically significant. 

It was shown in [TT] that under the hypothesis of exponentiality the statistic 
AN^ admits the representation 

yliV+ = sup f ^"j V vl/(?7,,f/,;t) + 0(n-i), 
o<t<i i<f^<„ 

where Ui, ...,Un are uniformly distributed on [0,1] rv's. The family of kernels is then 
given by 

^(xi, X2; t) = I{xi <t} + I{X2 <t} - I{xi < t}I{x2 < t}~ 

(l/2)(/{xi <2t- t^} + I{x2 <2t- t^}). 



The corresponding family of [/-statistics satisfies the monotonicity condition |llj . 
Simple calculations show that 

ij{z;t) = E(^(Xi,X2;t)|Xi = z) = {l-t)I{z < t} + ty2 - {1/2)1 {z < 2t - t^} 
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and hence the variance function is equal to 



alit) = (l/4)t(l - tf{2 -t)>0, 0<t<l. 

We can apply Theorem 2.4. 

The maximum of the variance function is attained at t = 1 — 1/ and is equal 
to 1/16. Therefore for some continuous function V2 

lim n-^ lnP(AA/;+ > a) = V2{a) = -2c? + O(a^), a ^ 0. 

n— >oo 

The same asymptotics is valid for statistics AN~ and AN^. 

3. One more characterization of the exponential law belongs to Puri and Rubin 
[23] : Let X and Y be independent rv 's with common absolutely continuous df F on 
i?"*". F is exponential if and only if \X — Y\ has the same distribution as X. 

We construct a one-sided t/-empirical version of the Kolmogorov-Smirnov test 
by introducing the statistic 

pi?+ = sup n 2) im-x,\<t}-n-'j2nxk<t}y 

— \ i<i<j<ri k=l / 

Statistics PRn and PRn are defined analogously. Thus we obtain the family of 
[/-statistics with the kernels 

'^{xi,X2;t) = I{\xi -X2I <t}- ^(/{xi < t} + I{x2 < t}), t > 0. 

Hence 

t) = E (/{|Xi -s\<t}- |(/{Xi < t} + l{s < t})) = 

^'^'-l) I{s>t}-e-\e-^-^), 



and the variance function is equal to 



al{t) = ^e-\l + e-'-2e-''),t>0. 

The maximum of this function is attained for e~* = ^(^^ and is equal to ■ 
Hence there exists a continuous function ^3 such that as a — 

y in 

lim n-^\nF{PRn > a) = ^3(0) = — + O(a^) ^ -2.8400^ + O(a^). 

n— >oo 3 

The next two examples are related to the problem of testing symmetry. 
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4. Let Xi, X2, . . . , Xn be a random sample from a continuous df Fq . We wish 
to test the hypothesis of symmetry about zero 

Ho : AFo(x) := Fq{x) + Fo(-x) -1 = \/x eR\ 

Consider the centered (in time) Kolmogorov-Smirnov statistics [25]. One of the 
one-sided statistics has the form 



Hn = sup 



/oo 
iAFM)dF4y) 
-00 



the second statistic H~ is introduced in a similar manner, and Hn = max(if+, H~). 

Formally, these statistics do not belong to the class of statistics SUI^ and SUn, 
however the difference between them is nonsignificant. Without loss of generality 
we can assume that the distribution of Xi is uniform on [—1, 1]. For any t G [—1,1] 
consider the family of statistics 

/I n 
(F^t) + F4~t))dF4t) = n-^ Yl ^(^*' ^) + 

where the family of kernels \E'(- ;t) has the form 

^'(Xi, Xs; t) = ^ [H^I <t} + I{xi < -t} + I{X2 <t} + I{X2 < -t} 



-2-I{xi+X2 <0}-l^ 
Now we introduce the auxiliary family of statistics 



l<i<j<n 

Note that for any t and any n > 1 

\H^it)~Kit)\<7n~\ 

so the large deviation asymptotics for supjif.^(t) and supj7i^(t) are the same. It is 
easy to check that 

^{z; t)=E (^(Xi, X2; t) |Xi = z) = ^ {I{z <t} + I{z <-t}+t-l), 



and hence 



= - \t\ + 1), -l<t< 1. 



Clearly = sup_i<i<i a|(t) = 1/12. 

Let us turn to the upper bound and consider the uniform partition of [—1, 1] into 
2N parts using the nodes tk = k/N, k = —N, ...,N. Obviously, 

sup n^t) < Kitk+l) + Fn{tk+l) - Fn{tk). 

*fc<t<tfc+i 

By Lemma 5.1 this ensures the monotonicity condition. Therefore for some contin- 
uous function t>4 as a ^ 

3 

lim n'^ lnP(//+ > a) = Vi{a) = --a^ + O(a^). 

n— »oo 2 

Similar results hold for H~ and iJ„. 

5. Another test of symmetry is based on the characterization established by 
Baringhaus and Henze [9]: The common distribution of two independent rv's X 
and Y is symmetric with respect to zero iff \X\ and |max(X, F)| have the same 
distribution. 

Consider two edf's based on the sample Xi, Let 

n 

Ln{x) =n'^^I{\Xj\ <x),x>0 
i=i 

and ^ 

Gn{x) = r ) A|max(X„Xfe)| <x),x> 0. 

^ ^ l<j<k<n 

Following [9], consider the statistic 

5if+ = sup[L„(x)-G„(x)]. 

The statistics BH:^^ and BHn are defined analogously. All these statistics are 
distribution-free, and we may assume that the observations are uniformly distributed 
on [—1, 1]. The statistic BH^ admits the representation 



M+=f^) sup ^(X^^X'^-^t) 



where for any t G (0, 1) 



E{X„Xk;t) = /{|max(X„Xfc)| < t} - ^{I{\X,\ <t} + /{|X,| < t}). 
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Formally this family does not fit our theory, however replacing edf F„ by edf 
Ln leads to minimal changes in the proofs. Simple calculations show that for any 
z G [—1, 1] and < t < 1 the projections of the kernels have the form 

-l<z<-t 

i{z-t) = E{^{Xi,X2]t)\Xi = z)= { 0, -t<z<t 

t < z <1. 

Consequently the variance function is given by 




ait) 



1 

/ (/{-I <x<-t}-I{t<x< l})^dx = -t\l - \t\), -l<t<l. 
J-i 4 



The maximum of this function is attained at ±| and is equal to ^. Hence the large 
deviation asymptotics has the form 

27 

lim n-^ \nF{BH+ > a) = v^ia) = + 0{a^), a ^ 0. 

n—*oo 8 

Similar statements hold true for the statistics BH^ and BHn- 

6. Consider the famous characterization of normality due to G. Polya ^26j|: Let 
X and Y he i.i.d. rv's with zero mean. Then X and (X + Y)/\/2 have the same 
distribution iff X and Y are normally distributed with some positive variance. 

The integral test of normality based on this characterization was proposed in 
[27]. Let us construct the scale-invariant Kolmogorov-type test comparing the usual 
edf and the [/-empirical df, based on . We arrive at the one-sided statistic 

P0+ = sup ( ^{^^ + < - E ^^^'^ < ^> ) ■ 

* \^ l<i<j<n k=l / 

The statistics PO^ and P0„ are introduced similarly. Our statistic PO^ corre- 
sponds to the family of kernels 

^(xi, X2; t) = I{xi + X2< tV2} - <t} + I{X2 < t}) 

and satisfies the monotonicity condition. 

Denote by A/" the df of the standard normal law. Then the projections of the 
kernels are 

iP{s;t) = E(^(Xi,X2;t)|X2 = s) = Ar{tV2 - s) - ^^/{t) - ^l{s < t}. 
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Consequently, the variance function is 

/oo j-t 11 

M\tV2 - s)dM{s) - / N{tV2 - s)dM{s) + -Nit) - -N\t). 
oo J —oo 

The problem of finding the maximum of this function analytically is difficult. 
However, its plot clearly shows that this maximum is attained at zero, and conse- 
quently is equal to 1/48. 
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Fig.l Plot of the variance function for Polya test. 



Therefore there exists a continuous function vq such that 

lim n'^ lnP(PO„ > a) = ve{a) = -6a^ + O(a^), a ^ 0. 

n^oo 



7 Calculation of local Bahadur efficiency 

The results on large deviations allow us to calculate local Bahadur efficiency of 
[/-empirical Kolmogorov-Smirnov tests. 

As an illustration we find the efficiency of Desu's test for some parametric alter- 
natives F0 to the hypothesis of exponentiality. The local exact slope, see [3j, is the 
main part as 6' —> of the expression 

4sup|(l-F,(x/2))2-(l-F9(x))|^ 

X 
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Take, for example, the WeibuU alternative with 1 — Fg{x) = exp(— > 0. 
Then the local exact slope is equivalent to 

41n2(2)sup(a;exp(-a;))2 9^ ^ 0.2601 9'^. 

X 

The theoretical maximum (double KuUback-Leibler information, see ^), is equal 
[TT] to 7r^/6 ■9'^ ^ 1.6449 9^. Hence the local Bahadur efficiency of Desu's test equals 
0.1581. At the same time the efficiency of Desu's test for Makeham alternative with 
the density (1 + 9{1 — e~^)) exp (— x — 6'[x — (1 — e""^)]), x > 0, 6* > 0, is much larger 
and is equal to 0.4938. 

Other calculations of local Bahadur efficiency can be found in [25], [ID] and |llj . 
It turns out that in some cases [/-empirical Kolmogorov tests have high efficiency and 
perform well compared to some other goodness-of-fit tests. For instance, the local 
efficiency of the sequence of statistics iJ„ for testing symmetry under the normal 
shift alternative is equal to 0.955. In the same problem the local efficiency of the 
sequence of statistics BHn is equal to 0.75. 

It would be interesting to construct new [/-empirical tests of Kolmogorov- Smirnov 
tj^e and to calculate their efficiencies using the large deviation results obtained in 
this paper. We hope to return to this question later. 
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